Pseudo Relevance Feedback with Deep Language Models and Dense Retrievers: Successes and Pitfalls

نویسندگان

چکیده

Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of bag-of-words retrievers. At same time, deep language models have been shown outperform traditional rerankers. However, it unclear how integrate PRF directly with emergent models. This article addresses this gap by investigating methods for integrating signals rerankers and dense retrievers based on We consider text-based, vector-based hybrid approaches investigate different ways combining scoring relevance signals. An extensive empirical evaluation was conducted across four datasets two task settings (retrieval ranking). Text-based results show that use had a mixed effect datasets. found best achieved when (i) concatenating each passage query, searching new set queries, then aggregating scores; (ii) using Borda aggregate scores from runs. Vector-based enhanced over several metrics. higher query retains either majority or weight within mechanism, shallower signal (i.e., smaller number top-ranked passages) employed, rather than deeper signal. Our method computationally efficient; thus, represents general others can

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-Language Pseudo-Relevance Feedback Techniques for Informal Text

Previous work has shown that pseudo relevance feedback (PRF) can be effective for cross-lingual information retrieval (CLIR). This research was primarily based on corpora such as news articles that are written using relatively formal language. In this paper, we revisit the problem of CLIR with a focus on the problems that arise with informal text, such as blogs and forums. To address the proble...

متن کامل

Multimedia Search with Pseudo-relevance Feedback

We present an algorithm for video retrieval that fuses the decisions of multiple retrieval agents in both text and image modalities. While the normalization and combination of evidence is novel, this paper emphasizes the successful use of negative pseudo-relevance feedback to improve image retrieval performance. While the results are still far from perrfect, pseudo-relevance feedback shows grea...

متن کامل

Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-relevance Feedback by Hussein Hazimeh Thesis

Pseudo-Relevance Feedback (PRF) is an important general technique for improving retrieval effectiveness without requiring any user effort. Several state-of-the-art PRF models are based on the language modeling approach where a query language model is learned based on feedback documents. In all these models, feedback documents are represented with unigram language models smoothed with a collecti...

متن کامل

Extract-biased pseudo-relevance feedback

Successfully retrieving a web document is a twofold problem: having an adequate query that can usefully and properly help filtering relevant documents from huge collections, and presenting the user those that may indeed fulfill his/her needs. In this paper, we focus on the first issue – the problem of having a misleading user query. The aim of the work is to refine a query by using extracts ins...

متن کامل

Structure Cognizant Pseudo Relevance Feedback

We propose a structure cognizant framework for pseudo relevance feedback (PRF). This has an application, for example, in selecting expansion terms for general search from subsets such as Wikipedia, wherein documents typically have a minimally fixed set of fields, viz., Title, Body, Infobox and Categories. In existing approaches to PRF based expansion, weights of expansion terms do not depend on...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Information Systems

سال: 2023

ISSN: ['1558-1152', '1558-2868', '1046-8188', '0734-2047']

DOI: https://doi.org/10.1145/3570724